Acoustic event recognition using dominant spectral basis vectors

نویسندگان

  • Woohyun Choi
  • Sangwook Park
  • David K. Han
  • Hanseok Ko
چکیده

This paper proposes a novel filter bank composed of dominant Spectral Basis Vectors (SBVs) in a spectrogram. Spectral envelopes represented by the SBVs have shown to be excellent characteristic features for discriminating different acoustic events in noisy environment. Non-negative Matrix Factorization (NMF) and non-negative K-SVD (NKSVD) for part-based and holistic representations extract dominant SBVs from a spectrogram. The effectiveness of the proposed method is demonstrated on a database of real life recordings via experiments, and its robust performance is compared to conventional methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Combining pattern recognition and deep-learning-based algorithms to automatically detect commercial quadcopters using audio signals (Research Article)

Commercial quadcopters with many private, commercial, and public sector applications are a rapidly advancing technology. Currently, there is no guarantee to facilitate the safe operation of these devices in the community. Three different automatic commercial quadcopters identification methods are presented in this paper. Among these three techniques, two are based on deep neural networks in whi...

متن کامل

Heterogeneous acoustic measurements and multiple classifiers for speech recognition

The acoustic-phonetic modeling component of most current speech recognition systems calculates a small set of homogeneous frame-based measurements at a single, fixed time-frequency resolution. This thesis presents evidence indicating that recognition performance can be significantly improved through a contrasting approach using more detailed and more diverse acoustic measurements, which we refe...

متن کامل

Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods

Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...

متن کامل

Minimum mean square error spectral peak envelope estimation for automatic vowel classification

Spectral feature computations continue to be a very difficult problem for accurate machine recognition of vowels especially in the presence of noise or for otherwise degraded acoustic signals. In this work, a new peak envelope method for vowel classification is developed, based on a missing frequency components model of speech recognition. According to this model, vowel recognition depends only...

متن کامل

A Robust Environmental Sound Recognition System using BPNN and RBFNN

Abstract— In a reverberant environment, the performance of acoustic event recognition system can be bolstered by choosing appropriate feature descriptors and classifier techniques. Neural networks are by far providing stunning classification results when compared to other classifiers. This paper analyses two different neural networks and their precision when they both stumble upon same targets ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015